A Hierarchical IRT Model for Criterion-Referenced Measurement
نویسندگان
چکیده
A hierarchical IRT model is proposed for mastery classification in criterionreferenced measurement. In this model, items measuring the same criterion are grouped, and a difficulty and discrimination parameter of the criterion is estimated on the same scale as the person and item parameters. The level of proficiency of a student with respect to the criterion is determined by the probability of success on the criterion. Cutoff points on the probability scale can be used to classify respondents into masters and nonmasters. The hierarchical IRT model is estimated using the Gibbs sampler and tested using posterior predictive checks. The model is illustrated with a test measuring the attainment targets of reading comprehension (in Dutch) at the end of primary education.
منابع مشابه
Marginal True-Score Measures and Reliability for Binary Items as a Function of Their IRT Parameters
This article provides analytic evaluations of population true-score measures for binary items given their item response theory (IRT) calibration. Under the assumption of normal trait distribution, the expected values of marginalized true scores, error variance, true score variance, and reliability for norm-referenced and criterion-referenced interpretations are presented as a function of the it...
متن کاملMeasurement Error in Hierarchical Gain Score Modeling
This paper compares three approaches for solving the problem of measurement error in a hierarchical gain score model. The pre-test and post-test scores are IRT scores with measurement error. Explanatory variables at student level and class level are considered in the model. Simulation results show that the gain score model that does not consider measurement error overestimates the explanatory v...
متن کاملMEASURING AND DETECTING DIFFERENTIAL ITEM FUNCTIONING IN CRITERION-REFERENCED LICENSING TEST A Theoretic Comparison of Methods
The validity of a measurement instrument depends on the quality of the items included in the instrument. The overall aim was to compare methods for detecting and measuring differential item functioning, DIF, in order to find a suitable method for examining DIF in a dichotomously scored criterion-referenced licensing test. The methods were discussed with respect to whether they are parametric, t...
متن کاملMultilevel IRT Modeling in Practice with the Package mlirt
Variance component models are generally accepted for the analysis of hierarchical structured data. A shortcoming is that outcome variables are still treated as measured without an error. Unreliable variables produce biases in the estimates of the other model parameters. The variability of the relationships across groups and the group-effects on individuals’ outcomes differ substantially when ta...
متن کاملBridging the semantic gap for software effort estimation by hierarchical feature selection techniques
Software project management is one of the significant activates in the software development process. Software Development Effort Estimation (SDEE) is a challenging task in the software project management. SDEE is an old activity in computer industry from 1940s and has been reviewed several times. A SDEE model is appropriate if it provides the accuracy and confidence simultaneously before softwa...
متن کامل